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Abstract 



^ Let C be a sequence of multisets of subspaces of a vector space F^. We describe a 

practical algorithm which computes a canonical form and the stabilizer of C under 
the group action of the general semilinear group. It allows us to solve canonical form 
**«D problems in coding theory, i.e. we are able to compute canonical forms of linear 

codes, Fq-lincar block codes over the alphabet F^s and random network codes under 
^-H their natural notion of equivalence. The algorithm that we are going to develop is 

based on the partition refinement method and generalizes a previous work by the 
Q author on the computation of canonical forms of linear codes. 
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1 Introduction 



In this paper wc consider the canonization problem defined on group actions 
in the following sense: Let G be a group acting on a set X from the left. 
Furthermore, let C{G) be the set of subgroups of G. 

Problem 1 (Canonization). Determine a function 



Cane -.X^XxGx C{G) 

X ^ {CFg{x), TRoix), Stabc (x)) 
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with 



Va; G X,V(7 G G : CFg(x) = CFcigx) (1) 
\/xeX : CFg(x) = TRg(x)x (2) 
Vx G X : Stabc (x) = {(? G G | (?x = x} (3) 

The element CFg{x) is called the canonical form of x and the element TRg'(x) G 
G a transporter element. The element TRg'(x) is well-defined up to the mul- 
tiplication with the stabilizer Stabc (x) from the right. 

In the case of a finite group G the orbit Gx is finite as well. Hence, apply- 
ing an orbit-stabilizer algorithm and defining CF(x) := minGx already solves 
this problem. Our goal is to define Can^ in such a way that there is an algo- 
rithm with good practical performance to compute a canonical form. Indeed, 
Cane is implicitly defined via the result of the algorithm. We included the 
stabilizer computation to the canonization process since the Homomorphism 
Principle (Theorem [sj [S] , which we will apply as a key tool, must have this 
data available. 



In this work, we will provide a practical canonization algorithm for sequences 
of (multi-) sets of subspaces under the action of the semilinear group. It will be 
a natural generalization of the algorithm |1] , where the author solves the same 
problem in the special case that all occurring subspaces are one-dimensional. 
Since this is a question arising from coding theory, the algorithm was formu- 
lated using generator matrices of linear codes. In fact, it canonizes generator 
matrices of linear codes. Similarly, the present problem also has applications 
in coding theory as well, see Section |4} 

[TT] investigate the computational complexity of the code equivalence problem 
for linear codes over finite fields. They show that this problem is not NP- 
complete. On the other hand, it is at least as hard as the graph isomorphism 
problem. The later problem has been studied for decades, but until now there 
is no polynomial time algorithm solving it. Therefore, we can not expect to 
give a polynomial time algorithm solving our present problem. We therefore 
measure efficiency in terms of running times on selected non-trivial examples. 

The paper is structured as follows: The next section will describe general 
methods for providing practical canonization algorithms like the partition re- 
finement approach. The program nauty [TU] is a prominent example using this 
idea: it canonizes a given graph under the action of the symmetric group, i.e. 
the relabeling of vertices. In Section |3| we give a reformulation of the original 
problem such that we are able to use the partition refinement idea, too. The 
subsequent section deals with the origins of this problem from coding theory. 
Subsection 5.1 summarizes the necessary modifications of Section |2] in order 
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to canonize linear codes. In the following, we give the details of the canon- 
ization algorithm for sequences of subspaces and finish this work with some 
applications of the algorithm in Section |6] and a conclusion. 



2 General canonization algorithms 

This section surveys four principle attacks for the canonization of an object 
X G X under the action of G. It is a summary of [6]. Therefore, we will omit 
the proofs. 

2.1 The direct approach 

The first method is the most desirable. It is directly attacking the problem, 
which means that we understand the group action in such a way that we are 
able to define CFg{x) and the group element TRg(x) and to give a polynomial- 
time algorithm for its computation without making (explicitly) use of the 
group structure. 

Example 2. Let X be a totally ordered set. The symmetric g roup Sfi cLcts oil 
X" via 



7i{xi, ...,Xn) := (a^TT-Hi), • • • , a;^-i(n)), for all n G 5"^, (xi, . . . , x„) G X". 

We may compute a canonical form of a given sequence (xi, . . . , Xn) by lexico- 
graphically sorting the elements of the vector. It is easy to define a transporter 
element. The stabilizer of (xi, . . . , x„) is the subgroup of elements in Sn which 
interchanges equal entries of the vector. 

Example 3. Let ¥q be the finite field with q elements, where q = p^ for some 
prime p. The general linear group GLk{q) is the set of all invertible k x k- 
matrices with entries in Fg. It acts on the set F^^'^ of all k x r-matrices using 
the usual matrix multiplication from the left. We may define a canonical form 
for M G F^^*" using the reduced row echelon form RREF(M) of M. Gaussian 
elimination is a polynomial-time algorithm to compute the canonical form 
and a transporter element under this action. Furthermore, the stabilizer of M 
could be easily given using the stabilizer of the canonical form RREF(M). 

Similarly, a canonical form of M G F^^*" under the action of GLr{q) - given by 
[A, M) I—)- MA^ - could be defined to be the one in reduced column echelon 
form RCEF(M). 
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2.2 Homomorphism Principle 

Definition 4. Let G be a group acting on a set X and H another group 
acting on Y. A pair of mappings {6 : X ^ Y, (f : G ^ H) wliere is a group 
homomorphism is called a homomorphism of group actions if the mappings 
commute with the actions, i.e. 6{gx) = ip{g)9{x), \/ g E G,x E X. 

In the case that ip is the identity on G, i.e. G = H, we call the function 6 a 
G -homomorphism. If the action on the right is trivial, i.e. hy = y for all y eY 
and /i e if, we call the function 6 G-invariant. In this case we could always 
suppose that \H\ = 1. 

Theorem 5 (Homomorphism Principle, [H]). Let {6 : X ^ Y,(f : G ^ H) be 
a homomorphism of group actions, with surjective mappings 6 and (p. Then 

• the stabilizer subgroup Stab^ (x) is a subgroup o/Stabc (^(a;)) := ip^^^^tahu 
and 

• we can define a canonization map CanG(x) in the following way: 

(1) Compute CanH{e{x)) = (CFh(^(x)), TRh(^(x)), Stab^ (^(a;))) for some 
fixed canonization map Caui;;-. 

(2) Compute g E ip-\TRH{9{x))) andG' := Stabc {g9{x)) = gStahc (^(x)) g-\ 

(3) Define Candx) := {CF g' (gx), TRc (gx) g, g'^Stahc (gx) g) for some fixed 
canonization map Cane- 

Example 6. Let G = (V, E) be a graph with finite vertex set V := {1, . . . ,n} 
and edges E C {{x, y} \ x,y E V : x ^ y}. There is a natural action of n E Sn 
on G defined in the following way: 

ttG := {V, nE) := {V, {{7r{x),7r{y)} \ {x, y} E E}). 

If {x, y} E E we say that y is a neighbor of x. Now, let N{x) count the number 
of neighbors of x and define the S'„-homomorphism N{G) := (A^(l), . . • , N{n)). 
The Homomorphism Principle tells us 

(1) to canonize this sequence under the action of the symmetric group Sn, 
for instance by sorting the sequence lexicographically. 

(2) If TT E Sn is the corresponding permutation, we have to relabel the graph 
via the application of vr and 

(3) canonize the relabeled graph under the stabilizer Stab^^ (7rA^(G)). 

We may interpret the result of N{7rG) as some coloring on the vertices. In the 
following this coloring has to be preserved by the group action. This allows 
us to apply the Homomorphism Principle recursively since in the following we 
can count neighbors of a single color class as well. 
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2.3 The lifting approach 



Let if be a subgroup of G, short: H < G. A subset T C G is called a right 
(left) transversal of H in G if it is a minimal but complete set of right (left) 
coset representatives, i.e. Ht ^ Ht' for all t,t' eT and G = UteT 

Proposition 7. Let G be a group acting on a totally ordered set X . Suppose 
that there is already some canonization Cann available for H < G and let T 
be a right transversal of H in G. Then, we can define the canonization map 
Cane for x E X in the following way: 

• CFg{x) := mint^rCF H{tx). 

• Let ti E T be a transversal element with CFg(x) = CFnitix). Define 
TRg(x) ■.= TRH{tix)t,. 

• Let ti,...,tm € T be those elements of T which define a canonical form 
CFnitix) = CFcix). The stabilizer Stabc (x) is generated by {titi^ \ i = 
2, . . . , m} and Stab^ (x) . 

Example 8. Like in the example above, let G = {V, E) be a graph with n 
vertices and let H := Stab^^ (1) be the stabilizer of 1 G V. Then, we may define 
the canonization of G under the action of Sn by comparing the canonical forms 
under the action of H for the n graphs derived by interchanging the vertices 
1 and i, i = 1, . . . ,n. 

For example, we may apply this approach if the number of neighbors is con- 
stant on G. Then, the separation of 1 E V allows us to color the vertex 1 
differently from all others and to count neighbors by colors again. This may 
result in different values and would allow us to define the canonization under 
H with the help of the Homomorphism Principle. 

2.4 Partitions and Refinements 

As we have seen in Example [8] it makes sense to combine the methods of Sub- 
sections 12.21 and 12.31 The basic idea is to alternate between both methods and 
is known as the partition refinement method: CanG'(a;) is recursively computed 
via 

(1) the application of the Homomorphism Principle for a well-defined se- 
quence of homomorphisms of group actions which may lead to a smaller 
stabilizer G' and the element x' = gx. 

(2) If the group G' is not trivial, we apply the lifting approach for a well- 
defined subgroup H < G' and recursively continue the computation 
of CdsiH{tx') for t G T in a similar way. Otherwise, we just return 
{.x'Ma, {idc}). 
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In this formulation, the different canonization processes Can//(tx') for the 
right transversal elements t G T in the lifting approach are carried out inde- 
pendently. Of course, making use of some global information in this processes 
could further reduce the computational complexity. The partition refinement 
method also considers this problem as we will see later. For this reason, we 
will replace the above formulation by a backtracking approach. 

Partition refinement methods are widely used in the canonization of combina- 
torial objects, for equivalence tests and automorphism group computations, 
for instance [HS|^ ?JTU]. In most cases the authors restrict themselves to the 
action of the symmetric group or some special subgroups. The formulation 
above shows that the ideas presented there are also applicable for arbitrary 
groups. 

Nevertheless, we will similarly formulate our algorithm only for the action of 
the symmetric group. The main reason for this restriction is an easier descrip- 
tion of the algorithm and some observations which we can only give in this 
special case. We will later see, that there is an action of the symmetric group 
in our problem, too. 

A partition of [n] :={!,..., n} is a set p = {Pi, . . . , P/} of disjoint nonempty 
subsets of [n] whose union is equal to [n] . We call the subsets P G p cells of the 
partition. Cells of cardinality 1 are singletons and the partition p is discrete 
if all its cells are singletons. If all cells of a partition p are intervals we call p a 
standard partition. In the following we will always use upper-case letters for 
standard partitions. The stabilizer 



S, := Stab5„ (p) := f] Stab5„ (P) 
Pep 

of the (standard) partition p is a (standard) Young subgroup of Sn- With 
Fixed (p) := {i G [n] \ {i} G p} we refer to those indices which define singletons 
of p, i.e. fixed points under the group action of Sp. 

The partition p is finer than the partition p' if each cell P G p is a subset 
of some cell of p'. We also call p a refinement of p' and say that p' is coarser 
than p. 

Differently to (Hl^Hn] our approach only uses standard partitions where the 
ordering of the cells is naturally defined by the elements they contain. This 
difference is due to the fact that we maintain a coset S'vpvr of a standard Young 
subgroup, which is the key data structure in all algorithms, by the pair vr) 
instead. 
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Fig. 1. Partition refinement backtrack tree 
2.4-1 Backtrack tree 

Suppose there is the group action of a standard Young subgroup Stpf, on a set 
X. For an element x E X we can compute its unique canonical form and its 
stabilizer using a backtrack procedure on the following search tree, see also 
Figure [1} 

• The root node of the search tree is (*Po) ids„) and we will apply a refinement 
on it as described below. 

• The nodes tt) where *P is discrete define leaves of the tree. 

• Otherwise, i.e. in the case that *P is not discrete, we perform an individualization- 
refinement step: 

■ Choose a well-definecQ cell P G *P which is not a singleton, called the 
target cell and use the lifting approach for S^i := Stab 5^ (m) < 5'<p where 
m = min(P): The refinement of *P is derived by separating the minimal 
element m G P, i.e. replace P by {m} and P \ {m}. If T is a right 
transversal of S'lp' in S'<p, the \T\ = |P| different children of the actual 
node are constructed by applying the permutations t E T. 

■ A refinement of for the node (^', tvr) could be computed via the appli- 
cation of the Homomorphism Principle using a fixed S'fp'-homomorphism 
/rp/ : X — )■ y. We choose the action and the canonical forms in Y in such 
a way that their stabilizers are again standard Young subgroups. 

Let a := TRs^, (/cp/(t7ra;)) and S'jh := Stab^^, (/sp'((jt7rx)) be the result 
of the canonization of /fp'(t7ra;). The principle tells us that we have to 
canonize the element atnx under the action of the group S'<h which is 
based again on further individualization-refinement steps. 

Traversing this tree in a depth-first search manner corresponds to the afore- 
mentioned alternating application of the Homomorphism Principle and the 
lifting approach. We are able to define the canonical representative Can^^ (ttx) 

^ The target cell selection is an Sqj-invariant. 
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if we have visited all children of the node tt). So far, the canonization un- 
der the action of S'(p/ are still independent processes for different t E T. The 
following definition of a total ordering <(p on X allows us to change this: 

a; <<p 1/ /<p'(a;) <y f<^'{y) V {f^'{x) = f^>{y) A x <x y) 

where <x and <y are still some arbitrary total orderings on X and Y respec- 
tively. 

This ordering will be used in the lifting approach for the definition of the 
minimum and it allows us to prune the search tree, i.e. skip the canonization 
Can^j^ (t27rx) in the following situation: If the canonical form <Jifs;pi{tiTTx) is 
smaller than the canonical form (72f^i{t2'n:x) in the Homomorphism Principle 
for two nodes arising in an individualization step, we prune the subtree rooted 



Remcirk 9. For the sake of simplicity, we did not use homomorphisms of 
group actions in the formulation of the refinement step and we restricted the 
formulations to Ssp' -homomorphisms /qj/. The function /tp/ itself might be a 
concatenation of several functions which allow a successive application of the 
Homomorphism Principle. In this case, we adapt the ordering <(p such that 
we may prune the tree in some intermediate step as well. 

In the case that = we may recursively use x <x y :<^=^ x <fH y, 

where 5*$^ := Stabg^, (x), to compare x and y. Only in the case that 91 is 
discrete, i.e. the corresponding node is a leaf, we use some fixed ordering on 
X. This also shows that we are not only allowed to compare the children of 
a fixed node among each other. In fact, we can prune a node (^',7r) of the 
search tree if there is a another node {^', a) on the same level having the 
same values for all 5'Q-homomorphism J'q applied from the root down to these 
nodes and whose actual image /fp/(7ra;) is larger than 

Theorem 10. Let D denote the discrete partition of [n]. Suppose that (S),7r) 
is the last visited leaf of this pruned search tree. The mapping 



CajiG{x) :— {TrXjir, jTr ^tTj | i e [a]}) 



defines a canonization, where tti, . . . are those permutations leading to all 
other leaf nodes {D, iTi) with TTiX equal to ttx. 

Proof. Let F := (/%,•••, /cp^) be the sequence of ^tp. -homomorphism, i — 
0, . . . , r applied in the generation process to the leaf node (3), tt). The sequence 
F defines a total ordering on X: 



in {^',t2n). 




X <Fy ■ 
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The pruning ensures that all other leaves [T) , a) of this search tree will lead 
to orbit elements ax with irx <f ax. This shows that all leaves which were 
not visited correspond to orbit elements that compare strictly larger than nx. 
Therefore, nx is the minimal orbit representative of S<:^qX under <p. It is 
not difficult to prove that starting this backtracking algorithm for some other 
element x' G Stp^x will lead to the same orbit representative tc'x' = ttx. □ 

Obviously, the depth-first-search strategy only allows the pruning of a subtree 
based on some partial information. In particular, we have to explore subtrees 
which will later be discarded. A breadth-first-search strategy would avoid this 
behavior. Nevertheless, there are more advantages of a depth-first search ap- 
proach: First of all, there are no storage limitations since we only have to 
maintain the path from the root node to the actual node. Furthermore, it 
is possible to discover automorphisms of the object x, since those can only 
be computed by the comparison of leaf nodes. The group of known automor- 
phisms of X allows us to perform a further pruning of the search tree. For this 
goal, we use the methods described in |H Section 5.2]: We store the subgroup 
of already known automorphisms yl < 5*^ by a complete labeled branching, 
pi, Lemma 5.9] now gives a simple criterion if the coset S'lpVT, i.e. the subtree 
below the node vr), has to be traversed or not. 

Finally, we would like to mention that [12] discusses a mixture of both strate- 
gies in the computation of a canonical form of a graph. We think that this 
approach might be applicable in our case as well, but we are not yet sure about 
all the consequences because we have to incorporate a second group action on 
X at the same time, as we will see in the following section. 



3 A reformulation of the problem 

The projective space Pg{k) is the set of all subspaces of F^. As usual, we 
call the one-dimensional subspaces points and the {k — l)-dimensional sub- 
spaces hyperplanes. Let Aut(Fg) denote the automorphism group of Fg. Recall 
that any automorphism a G Aut(F5) is a power of the Frobenius auto- 
morphism T : X ^ x^. It applies to vectors and matrices element-wise. The 
set of all semilinear mappings, i.e. the general semilinear group rLfc(g) := 
GLfe(g) X Aut(Fg), decomposes as a semidirect product with multiplication 
(A,r»)(5,r'') := (A r"(5), r'^+*). 

Remark 11. Let A^, H be arbitrary groups and ip : H ^ Aut(A^) be a homo- 
morphism between the group H and the automorphism group of A^. Although 
the multiplication of elements (ni, /ii), (n2, /i2) E N H depends on the 
choice of (p, i.e. (rii, hi){n2, := {niip{hi){n2)^ hih2), we will not give the ex- 
act definition of the homomorphism ip when introducing semidirect products 
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of groups in the following. We believe that the right choice of if can always be 
observed from the context. 

There is a natural action of rLfc(g) on the projective space ^q{k) from the 
left, i.e. 



rLfe(g)xPg(fc)^P,(A;) 
((Ar"),W)^Ar"(W). 

Since this action is not faithful, one may also factor out the kernel resulting 
in the action of PrLjt(g) := rLjt(g)/F* on Pg(A;), where F* denotes the multi- 
plicative group of Fg. We use both groups and both actions interchangeably. 

The goal of this paper is the description of a practical canonization algorithm 
for a given sequence of (multi-) sets C = (Ci, . . . , Cm), with Ci C Vq{k) under 
the action of rLfc(g). The stabilizer subgroup 



Aut(C) := Stabru,{,) (C) := {^{{A,t^) G rL,(g) | V W G : {A,r'')U G C,,} 

is computed by the algorithm at the same time without any additional effort. 
We apply this algorithm to solve canonization problems in coding theory, see 
Section il 

The remaining part of this section deals with further modifications of the given 
sequence C we could make: 

• We may assume that the multisets Ci are in fact disjoint subsets. Oth- 
erwise, we could distinguish the occurring subspaces by their sequence of 
multiplicities, which leads to a sequence of disjoint subsets C . This defines 
an FLfc-homomorphism and the stabilizer of C acts trivially on C. 

• The action of rLfc(g) preserves the dimension of any U G Pg(A;). Hence, 
asking for a canonization algorithm for a set Ci is equivalent to ask for a 
canonization of the sequence {{U G Ci \ dim(W) = s})<i=o,...,fc. Therefore, we 
can assume that all elements of a subset Ci have fixed dimension < Sj < fc. 

• If some subset Ci is empty or equal to {U G ^q{k) \ dim(f/) = s}, i.e. the 
subset of all s-dimensional subspaces of F^, for some s = 0, . . . ,k, we can 
remove Ci from the sequence since the action of rLk{q) on this subset is 
trivial. Therefore we could suppose that 1 < Si < k — 1. 

• The union Ui^i Ci spans the whole space, otherwise we would be able to 
solve the problem in a smaller ambient space Ff < k. 

In the following, we suppose that the sequence C = (Ci, . . . ,Cm) and therefore 
also the parameters q, k, m, rii, Si, n = J^iLi rii will be fixed. 
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Proposition 12. For a given subspace U 6 Pg(/c) let U-^ := {f G | v'^u = 
0} be its dual subspace. The dual subspace of {A,t°-)U for (A, r") G TLk{q) is 
equal to {A~^'^ , a)l{-^ . 

Proof Let U G Pg{k), {A,t'') G TLk{q) and u e U,v eU^ he arbitrary. The 
equation 



{{A'^^,r'')vf{A,T'')u = T^vfA-^Ar'^iu) = t^v^u) = r'^(O) = 

sliows tliat {A~^'^ ,T°')U-^ C ((y4, r")W)"'". But botli subspaces liave dimension 
n — dim(W) and hence must be equal. □ 

Remark 13. If we define C^- := {U^ \ U e Ci] and := {Ci , . . . ,Ci) then 
we may also canonize C"*", i.e. compute CanrLfc(g)(C"'"), and define the canonical 

formCFrL,(,)(C) := (CF rLfc(q)(C"^)) . The automorphism group of C is equal 
to 



[A-^^.t'') I (At") G Aut(C^)}. 

This transformation will always be applied if we suppose that the computa- 
tional effort of computing CanrLfe(g)(C-'-) is less expensive than the computation 
of CanrL,(g)(C). 

Let F^^"'** denote the set of A; x n-matrices of rank s. The algorithm we are 
going to develop is a generalization of the canonization algorithm for linear 



codes, see [IfS] and Section 5.1 for a short summary. Instead of working on 



linear codes directly, the problem is transferred to generator matrices of linear 
codes, i.e. matrices whose rows form an F^-basis of the linear code. Two ma- 
trices r, r' G F^^"'^ generate equivalent codes, if their orbits under the group 
action of (GLfc(g) x F*") x {Sn x Aut(Fq)) are the same. It is a well-known 
fact [2, 9.1.2] that there is a one-to-one correspondence of these orbits and the 
orbits of rLfc(g) on multisets of at most n points in the projective space, which 
span a vector space of dimension k. Therefore, the canonization algorithm for 
linear codes already solves the canonization problem for any multiset of points. 
A closer look reveals that the algorithm similarly transfers the multiset to a 
sequence of disjoint sets of points. 

Now, represent the element W G by some matrix U G F^^*'''^* whose columns 
generate U. The set of all matrices generating U in this regard is equal to the 
orbit Ghs^{q)U := {UA^ \ A G GL.,. (g)}. Analogously, we can identify the set 
Ci := {Wi, . . . with the orbit of on (Wi, . . . In summary, there 

is a natural one-to-one correspondence between the set Ci and the orbit of 
(f/i, . . . , f/„.) under the action of GL^. (g)"* x S'„.. This semidirect product is 
equal to the wreath product GL^ . (g) I Srn ■ 
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In the case that Sj = 1 we know that GLi(g) = F* and the group GLi(g)"' x 
Srn = F* I Sm is isomorphic to the group of F*-monomial matrices, i.e. the set 
of permutation matrices whose nonzero entries got replaced by elements from 
F*. In this regard, we can view the wreath product GL^. (g) 1 3^ as the group of 
GLs-(g)-monomial matrices by replacing the nonzero entries of a permutation 
matrix by arbitrary elements from GL^- (g) and the zero entries by (sj x Si)-zero 
matrices. 

Altogether, we can identify the sequence C with the orbit of 



under the action of nili(GLs-(g)"' x S'„.) which could be interpreted as the 
group of block diagonal matrices whose m nonzero blocks are equal to GL^- (g)- 
monomial matrices. 

Finally, taking the action of rLfc(g) into account we have to canonize the 
sequence ({Ui \ . . . , Uj^})^ .^^ ^ under the action of 



^GUiq) X lliGUAqr X Snjj X Aut(F,), 
where the action is defined as follows: 

Let [A, (b[^\ . . . , B^^\ ir'^^A , r'^ ) be an element of the acting group then 



^-°(f:'.i--.,„)«!"'.-.^-"(f':;!--.,„,)^a'').,„. 



In order to apply the methods developed in the Subsection 2.4, we change the 
order in which we compose the group: 



GL,,(g) X n GL,^(g)"> x Aut(F,) x J] ^n. 
\ i=\ ) ) i=\ 

GL,(g) X n ^UXqT X Aut(Fg) x 



and replace the permutational part of this group using the standard Young 
subgroup S'fpg to the partition *Po '■= {{l? . . . , ni}, . . . , {n — + !,•••, of 
[n]. 
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A final reformulation of our problem will be given in Section [5] since we would 
like to motivate the algorithm with the observations given in It will show 
that we could observe a homomorphic group action of the symmetric group 



S'fpf,. This allows us to apply the ideas developed in Subsection 2.4 



As we have seen above, the comparison of the objects we are working with 
plays a central role in the canonization process. In our case, if nothing else is 
stated, we will suppose that ¥q is totally ordered such that < 1 < /i for all 
G F*. Then we can totally order the set of fcxn-matrices by interpreting them 
as lexicographically ordered sequences of colexicographically ordered column 
vectors. 

Furthermore, we will access submatrices of a matrix U G F^^" in the following 
way: 

• f/^K^j denotes the i-th column of U. Similarly, we write Ui^^, for the i-th row. 

• For a sequence / := {ii, . . . , im) of indices ij G [n] we write U^j := (f/*,ii, . . . , 
for the projection of the matrix onto the columns given by /. We also use 
this notation for the set I := {^i, . . . ,im} which should be interpreted as 
the lexicographically ordered sequence of its elements. 

• Finally, if J is a sequence of indices in [k], then Uj^^ denotes a similar access 
to the rows of U and Ujj := (f/j,*)*,/. 



4 Coding theory 



Let {Mi,di) and {M2,d2) be two metric spaces. A map l : Mi — )■ M2 is an 
isometry if it respects distances, i.e. d2{L{x) , L{y)) = di{x,y) for all x,y E Mi. 



4-1 Random network codes 

The subspace distance is a metric on the projective space Pq{k) given by 

ds{U, V) := dim(W + V) - dim(W n V) = dim(W) + dim(V) - 2 dim(W n V) 

for any U,V E Pq{k). It is a suitable distance for coding over the operator 
channel using so-called random network codes C C Pq{k), see [7j. 

Obviously, the action of an element of the general semilinear group preserves 
the subspace distance. On the other hand, [13] showed that PTLk{q) is iso- 
morphic to the group of isometries on Pq{k) which preserve the dimension of 
each element in Pq{k). The dimension is another basic property of a codeword 
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which should be preserved, too. Therefore, it makes sense to define equiva- 
lence of random network codes by means of this group action. It shows that 
the canonization of random network codes is a special case of our algorithm 
for sequences of length one. 



4-2 Additive codes 

An ¥q-linear block code over the alphabet ¥ga,s > 1 is an F^-linear subset 
of ¥gs equipped with the usual Hamming distance (inam- Additive codes with 
s = 1 are classical linear codes. For s > 1, those codes are sometimes also 
called additive codes. They gained more and more interest in the past years 
since for example self-orthogonal additive codes over Fg2 could be used for 
quantum error- correct ion, see [Ij. 

With an Fg-linear representation of the elements of F^s in F*, the Fg-linear code 
can be represented by a generator matrix with entries in Fg. Let T : Fgs — )■ F^ 
denote the corresponding F^-linear mapping. Defining the distance 



II, else 

and extending this definition as usual to F^" we are able to find all isometrics 
on mapping F^-linear codes onto Fg-linear codes: 

• The multiplication of F* by an invertible matrix A G GLs{q) defines an 
Fg-linear isometry. 

• The same holds for the permutation of the n components of F*". 

• The element- wise application of an automorphism of ¥g defines an isometry 
on F*", which maps Fg-linear codes onto F^-linear codes. 

Altogether, this defines a group action of GLs(g)" x {Sn x Aut(Fq)) on the set 
of Fq-linear subsets of F^^. Since isometrics are injective, we could also restrict 
this action to act on subsets C with dimp (C) = k. Each such subset C could 
be represented by a generator matrix F G F^^*"''^. The set of all generator 
matrices of C is equal to the orbit GLfc(g)F. Hence, we are interested in the 
canonization of a generator matrix under the group action of 

(GLfc(g) X GL,(g)") x {S„. x Aut(F,)). 

Since every s consecutive columns may define the same subspace, the code 
could be identified with a multiset of subspaces of F^. 
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5 The Algorithm 



In this section we develop a practical algorithm which computes the automor- 
phism group and a canonical form of a given sequence C = {Ci, . . . ,Cm)- In 
Section [2] we have seen why it is useful to combine the searches for both tasks. 



5. 1 The algorithm for linear codes revisited 



First of all, we want to motivate our procedure by a reformulation of the 
canonization algorithm in |3] for linear codes using the language developed in 
Section [2] Some of the ideas we are going to introduce are based on which 
gives a more detailed description of the backtracking approach. The algorithm 
that we are going to develop in the subsequent subsections can be seen as a 
natural generalization of the one for linear codes. 



We first observe that we could compute a canonical form of a fc-dimensional 
linear code C with generator matrix F e ip^x^.'^ using the ideas presented in 

Section 2.4: For simphcity, let G^*^') := (GLfc(g) x F*") x Aut(Fq). The group 

acts on the set of orbits G^'^^WF^^"''^ := {G^^'^F' | F' e F^^"''^} and we can 
define a homomorphism of group actions 



Q . jpfcxn,fc 



^ G(^i)\\F: 

F ^ G(^i)F 



'kxn,k 



if : G(^i) X S<^, ^ S<^, 



Before we provide the details of the canonization Can^,^^ (G^'^^^F) we explain 
how to define CanQ(si)^^^ using the Homomorphism Principle: First of all, 
in [1] it is observed that there is a direct and efficient canonization algorithm 
CanG(.i) for the action of G^^^^) on Fj^". Let vr = TRc.^^ (G^^^^F) and G^*^') x i7 = 

StabQ{si)>^5^,^^ (^ttG^^^^F^ be the result of the canonization. Since we know that 

(G^*^') X if) ttF = G^^i^vrF it remains to define CFG(=i)>,H(7rF) := CFG(si)(7rF) 
and the transporter TRQ(si)><,j|^(7rF) := (TRq(si) (vrF), id). There is also a simple 
way to compute the automorphism group 

StabQ{si)^ji^ (F) using the canonization under the action of G*^'^^-'. The details are 
left to the reader. We will later see that all necessary data is already computed 
in the computation of Can^ (G'^'^^^F). 
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5.1.1 Backtrack search 



This shows that we are able to give a practical canonization algorithm for 
linear codes if we are able to give a practical algorithm for the computation 
of Can5,^,^^ (G'^'^'-'r). This algorithm will be based on the partition refinement 
idea. In this algorithm, it is necessary to compare the leaves of the backtrack 
search tree. Therefore, we have to define a total ordering on G*^'^^''\Fg^"''^: Let 
CanQ{si) be a canonization algorithm for the action of C-*^'-* on the set F^^"'^, 
then we may order the orbits via the ordering on their canonical forms. 

It remains to give the S'm-homomorphisms which will be applied to a node 

y 

(*P,7r). The first is closely related to CanQ(si). Let Fixed vr) be the se- 
quence of elements of Fixed in the order they appeared as singletons in 
the refinement process *Po > • • • > ^ leading to this node (*^3,vr). We say 
that a matrix F^^''^^ is a semicanonical representative of the node (^, vr) if 
r(*.-) G G^^'^ttF and 



fF^'P'")) , < (F') =^,,^ , for all F' G ttG^^'^F = G^^'VF. 

Proposition 14. The projection -kG'^^^^V ^ [V^^'^^] j. is Sm-invariant. 

This invariant is applied immediately after each individualization step and 
after each refinement which leads to a new singleton in the partition Since 
it is an invariant, it will not refine the partition But, it will give us the 
possibility to prune the search tree. 

For a child (fH, air) of vr) the semicanonical representative could be easily 
computed from the semicanonical representative of vr). For this computa- 
tion, we only need to know the stabilizer /nn^'P'^) < G^'^') of ( T^'^'^A , 

V / *,Fixed(*p,7r) 

where the action is defined by 



(A,6,r")r := (A, (&,),.,^(^,^),r")r for all F' G Ff I^^^^^P)!. 
For more details see [Ij. 

Therefore, it makes sense to add this data to the nodes of the backtrack tree. 



We modify the nodes tt) of Section 2.4, such that the algorithm addition- 
ally maintains the orbit G^'^'-'ttF by the pair (r(V.^), /nn^'P''^)). We will call the 
action by Inn^'^''^^ the inner group action and Inn'^^'^^ the inner stabilizer. The 
computation of the semicanonical representative of vr) will also be called 
the inner minimization process. 

Remark 15. The computation of a semicanonical representative itself could 
be seen as an application of the Homomorphism Principle applied to the G*^**^-'- 
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homomorphism T k-)- T^^^t^^^^^. If ^ is discrete the semicanonical represen- 
tative defines a canonical form of the orbit G'^'^^^TrF. 

Remark 16. There is also a second interpretation of the subtree below some 
node 77, r^'P''^\ Inn^^'^^). It could be identified as the canonization of T^'^''^^ 
under the action of Inn^'^''^^ x Ssp. For the root node, the group 
is equal to G^*^'^ x Sfp^, i.e. the action we are actually interested in. We have 
motivated this backtracking with the canonization under S'fp^, since 

• we already proved its correctness in Section [274 

• the test on the group of known automorphisms is restricted to standard 
Young subgroups, and 

• the homomorphism of group actions we are going to apply should have this 
special structure, i.e. they will be either S^p-homomorphisms or equal to 
the inner minimization process. This will avoid the occurrence of complex 
subgroups of G*^*^'^ x S'sp^ for which it would be difficult to define appropriate 
homomorphisms of group actions in the refinement steps. Furthermore, the 
computation of the transversal T in an individualization step would become 
more complicated, too. 

Apart from some further Ssp-homomorphisms which make use of (F*^*"'^-', Inn^'^'^^), 
there is another very important Sfp-homomorphism used to derive further re- 
finements. In particular, this S'fp-homomorphism works already very well on 
nodes on the first levels of the backtrack tree. We are going to generalize it in 
the following. Again, we will give a more general description of this function 
than given in p|5]. The basic idea is a modification of Leon's algorithm, j9], 
for the computation of the automorphism group of a linear code: 



5.1.2 Leon's invariant set of codewords 

Suppose that W := {c^^\ . . . , c^^'^~^^'^^ C C is a set of codewords, which is 
invariant under the automorphism group of C. In fact, if one wants to apply 
Leon's algorithm, [9], to test two linear codes for equivalence the mapping 
C ^ W has to be an (F*)" x {Sn x Aut(Fq))-homomorphism. For simplicity 
suppose that W is formed by all words of minimal nonzero weight d. This 
set is once computed for the root node and fixed for the whole backtracking 
algorithm. 

For each codeword c^^'^ there is a well-defined information word f G 
such that f'^-'^F = c^^\ Since W is closed under scalar multiplication, we may 
restrict ourselves to the set W of projective representatives and define by 
1-L := {f '-•'^ I j = 1, ...,/;.}:= {f G Fg I t>F G W} the corresponding set of 
information words. 

The standard inner product of (t>*^-'\ F^^j) = v'^^^'^V^.^i is equal to cf \ the i-th. 
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coordinate of the vector c^^\ Therefore, the set Ji is the well defined subset of 
all normal vectors of those hyperplanes containing exactly n — d points (F* j). 

This gives a bipartite, vertex-colored subgraph of the subspace lattice of the 
projective space, whose vertices are labeled by [n] and {n+1, . . . , n+h} respec- 
tively. Initially, the colors only distinguish vertices by dimension and in the 
case of points additionally by the cell they are contained in. Since the action 
by rLfc(g) obviously preserves the graph structure, this graph is independent 
from the actual representative of G'^'^'-T. Furthermore, it is well-defined up to 
the action of Sh on the vertices {n + 1, . . . ,n + h}. The permutation of the 
columns of the generator matrix results in a relabeling of the vertex set [n] . 

Cell-wise counting of neighbors for each vertex allows us to define an S'lp^ x Sh- 
homomorphism and hence to apply the Homomorphism Principle to refine the 
partition (coloring) of [n + /i]. Since the projection on the first n components 
also obviously defines an SspQ-homomorphism, this could be also seen as a 
refinement of the root node of the backtrack search tree. The resulting permu- 
tation in the application of the Homomorphism Principle gives a simultaneous 
relabeling of the graph and a permutation of the columns of the generator ma- 
trix. 

This homomorphism on the incidence graph is also used in the computation 
of a canonical form of an arbitrary graph pTO]. We furthermore observe that 
the finer partitions DIq of [n] and £2o of [h] allow us to call this invariant 
iteratively. Hence, the nodes of the backtrack tree additionally maintain a 
partition £3 of the set [h]. Furthermore, instead of storing the relabeled graph, 
we just maintain a second permutation a & Sh which stores the relabeling on 
the vertices {n + 1, . . . ,n + h}. Any refinement on the partition *P during the 
backtrack search gives us the possibility to restart this refinement process on 
the relabeled, newly colored incidence graph again. 



5.2 Preprocessing and the backtrack tree 

In the same manner, we start the algorithm for the sequence C by some pre- 
processing routine, which is in fact an FLfc(g)-homomorphism computing a 
set H of hyperplanes: For each hyperplane (v) G Pq{k) we may compute 
the number of elements it contains from each cell of *Po- This results in a 
unique partition of the set of all hyperplanes. We choose a well-defined subset 
n:=UT=i^i, which is a union of the blocks of this partition in such a way 
that it contains k linearly independent normal vectors. This set should also 
be reasonably small. 

Example 17. Let C := {UiMiM-i} C P3(4) with Ui generated by 
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1 0^ 



^0 10^ 

The hyperplanes 7/1 := <j ( (0, 1, 0, 0)'' )" , ( (0, 0, 0, 1)^' )"1 contain exactly 2 el- 



ements of C, whereas the elements (^(0, 1, 0, z/)^y , ^(yU, 1, 0, 0)^y and (^{0, 0, /i, 1) 
for z/ G Fg,// G F3 contain a single element of C. They form the set 'H2- The 
remaining hyperplanes contain none of the elements from C. We choose the 
set "H = "Hi U since it is the smallest set spanning the whole vector space. 

In difference to the algorithm for linear codes described above, we do not only 
use this set as a tool for the refinement of a node. In fact, we append the 
sequence (J-Li, . . . , T-Lm') to C and it plays an active role in the construction of 
the backtrack search tree, as we will see later. We are allowed to do so because 
of the following proposition: 

Proposition 18. Let G be a group which acts on X and f : X ^ Y a G- 

homomorphism. Suppose there is a canonization algorithm CanG((x, /(x))) for 
the action of G on {{x,y) | x G X, ?/ G Y}. Then, this defines a canonization 
algorithm on X via 



CanG(x) := (CFG((a;, /(a;)))i, TRG((a:, /(x))), Stabc ((x, /(x)))) . 

One reason for this decision is the fact that the partition £} of [h] ,h := {T-Ll 
allows us to perform the individualization step on a cell with smaller cardi- 
nality and hence results in a smaller branching factor, which we see as an 
advantage. On the other hand, we realized that it is much more difficult to 
give an efficient canonization algorithm for the action of 



(^GLfc(g) X JlGLsAqrj x Aut(F,) 

on the sequence of subspaces whose coordinates are fixed by S'lp. 

Let *P be a refinement of {{1, . . . , n}, {n + 1, . . . , n + h}}. In the following 
we use to refer to the corresponding partition of [n] and to refer to 
the partition of the set [h] arising from the partition of the last h coordinates 
given by 

Instead of representing the hyperplane H El-L using a matrix U G Jp^xC^-i).^-! 
we use its dual space which could be represented by a single vector t> G Fj - 

a normal vector of H. Since {A^a)H^ = ({A~^'^,a)v\ , see Proposition 



we have to keep in mind that the action is differently defined on them. 



12 



19 



Altogether, we are going to develop a canonization algorithm for the action of 
G("i) X S<p^ on n^i(Ff X (^g)^ where 

G(^') := (cUiq) X flGUXqr x Ff j x Aut(F,) 

and is the partition of [n + /i] given by the different subsets. The action of 
G*^*^^^ on a vector {U, V) is defined in the following way: 



(v4, {Bi)i^[n], {bj)j(^[h],T'') ((f/i)i6[„], {Vj)j(.[h]) 




Before we are going to describe the rules for building up the backtrack tree, 
in particular how to define an analogue inner minimization procedure and 
suitable Ssp-homomorphisms, we shortly summarize which information should 
be contained in each node: 



A permutation vr G Sn+h and a standard partition *p of [n + h] describing 



the state of the backtrack tree analogously to Section 2.4 

A vector ([/(^s.^)^ G (llLi F^"""^'^'"^*^) x (F^)'', storing the semicano- 

nical representative of this node. 

A subgroup /nn^'P'^) < (GL„(g) x JlLi GL,(i)(g) x (F^^) x Aut(Fq) which 
stores the stabilizer under the inner group action. 



Similar to Remark[T6} we can interpret this backtracking as an algorithm which 
computes Can^^^ [G^^^\U,V)^ or as the canonization Ca.nQ(si)^s^^{U,V). Al- 
together, this solves our initial canonization problem for sequences of subsets 
in the projective space. 

Example 19 (Example [l7| continued) . We can choose 
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111 
00021012 
1110 
12 110 





1 

1 



as a semicanonical representative of the root node (*Po;id) with the initial 
partition := {{1, 2, 3}, {4, . . . , 11}, {12, 13}}. 

In the above representation of ([/('Po'^'^), y('Po,id)'j already included the par- 
tition dashed lines mark the end of the matrices uj;'^°'^'^\ whereas solid 
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lines mark the end of a cell of the partition. The double horizontal line shows 
the change from t/^^"'^'^) to 

Before starting the backtracking, we refine *Po based on the incidence graph 



like in the case of linear codes, see also Subsection |5 . 4 . 2 1 f or a detailed descrip- 
tion in our case: 



• We observe that is different from Ui,U2 since it is contained in both 
hyperplanes in "Hi whereas the other two elements are only contained in a 
single hyperplane. 

• The hyperplanes which contain U3 can be distinguished from all others. 
This leads to the following refinement of the root node: 
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Olj 



where we also applied the permutation (2,3)(7, 10)(8, 11) G S'fp„. 

In the following subsections we discuss the generalization of the inner mini- 
mization process and the definition of suitable Srp-homomorphisms. In Sub- 



section 5^ at the end of this section the whole backtracking procedure will be 
summarized. 



5.3 Inner Minimization 



One main observation of [4J is that (in the computation of a canonical form of 
a linear code) the GLfe(g)-component of the stabilizer Inn'^^'^^ could be easily 
stored by a pair (t, p) where < t < A; and p is a partition of [t], i.e. it is equal 
to 



D G F*^* diagonal matrix and constant on all P G p. 
Pi GFf P2eGL,_i(g) 

In the case that p is the discrete partition we simply denote this group by 
G\}-l\q) := GL^*'^^^^''"'^*^^''(g). We will make a similar observation in this work, 
but since the result is achieved by the action on the normal vectors of the 
hyperplanes we have to use the transposed group 
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instead. This group has furthermore the following nice property: 

Proposition 20. The multiplication of some matrix A G GL^*'''^(g) from the 
left stabilizes the first t rows of a matrix U G F^^^ up to scalars. 

In the following, let s{j) denote the number of columns of the matrix Uj, j G 
[n].We describe the inner minimization process which is always applied after 
the partition *p has been refined to 91 combined with the application of some 
permutation a G S<^, i.e. after each individualization step and any successful 
refinement. The semicanonical representative of (9^, an) is the one which is 
derived from the following sequence of applications of the Homomorphism 
Principle: 

• Let n=— AU,V) := V i^— ^.^j ^ and 



^Fl^mn) ^^^^^^"^ X f;I™(^«)I) X Aut(F,) 

The pair (n^^^,^^, v?^(g^^)) defines a homomorphism of group actions. 
We will later prove that the GLfc(g)-component of the stabilizer in the previ- 
ous step is a subgroup of GL^*^ (q) . In particular, we can use the parameter 
t in the following definition: 

Let (U,V) := ((Ui),,.) , and 

Fixed(«c) V *^M'*/jGFWl(lKc) 



^iLkiv^e) ^^"^^"""^ ^ ^ JJ GL.(,)(g) X Aut(F,) 

\ ieFixc<l(!Kc) / 

This defines a homomorphism of group actions. 

We also apply a third homomorphism of group actions which restricts the 
components GLs^i){q) for i E [n] \ Fixed (fHc)- K is only called in special 
cases and therefore we will state this at some more appropriate place, see 
Subsection 15.3.2.31 



5.3.1 The structure of the inner stabilizer and semicanonical representatives 

We start with a description of the inner stabilizer Inn^'^''^^ which is computed 
in each step: For simplicity we just write Inn and {U, V) in the following. 
The description of the computation of a semicanonical representative is given 
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afterward. It is based on a recursive method and the full details are stated in 
Subsection 15.3.21 

After the inner minimization process, the group Inn can be expressed by the 
parameters 

• (t,p) - describing the multiplication from the left 

• (ii)ie[n] - describing the multiplication from the right for each sequence 
element 

• e - describing the subgroup of field automorphisms 
in the following way: The group Inn is the subgroup of 



GL^)(g)%nGLi;;;(g)x(F;)'^) ><(0 

i=l 



containing all elements y(^Ai A2) ' (( )) j with the following prop- 

erties: 



I f'^jh = V J e Fixed i^n) (4) 
Dm^,^^^,^^Ef = (f7,,)j,]^j,^], V z e Fixed (^c) (5) 



The integer t is well defined by the rank of V ^. — f,,^ Furthermore, the 
inner minimization ensures that this matrix is in reduced row echelon form. 
Similarly, the integer ti is well-defined by the rank of the submatrix (f/j)[j] ^ 
consisting of the first t rows of Ui. The inner minimization produces a special 
structure of these matrices, i.e.: (f/j)[(] ^ = ((f^i)[t] [u]^ o) where (?7i)[t] [u] ^ F*^** 
is in reduced column echelon form up to scalars. 

The partition p is equal to the finest partition whose cells contain the sup- 
ports of all vectors Vj,j G Fixed and the supports of the columns of 
(^i)[t] [ti]'"^ ^ Fixed (*Pc)- The exponent e is equal to the least positive power 
of the Frobenius automorphism which fixes all entries of Vj,j G Fixed 
and (f/,)[j] [j^pi e Fixed (q?c)- 

Corollary 21. For A G GL^*''^''(g) there exists a group element 



{A,B{A),b{A),T^) G Inn. 
Furthermore, {Ik, Is{i), ■ ■ ■ , Is{n), t^) is an element of Inn. 
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5.3.2 Inner Minimization Process 



In the following, we are going to describe the inner minimization process in de- 
tail. Let tt) be the partition of the predecessor node and (crvr, 9i) the actual 
node. For simplicity we suppose that the permutation a was already applied to 
the sequence {U^'^'''\ V'CP''^)) and that we have to compute (f/(^''^'^), l/C^.'^'^)). 

In the case that Fixed = Fixed there is nothing to do. Otherwise, 
we successively modify the elements of the sequence corresponding to the 
indices i G Fixed (9^) starting from those components corresponding to the 
hyperplanes. Additionally, we have to give rules how to change Inn in each step 
in order to guarantee that the i-th entry of the semicanonical representative 
does fulfill Equations ^ and ([s]). 

The procedures of the next two paragraphs are converted from the algorithm 
for linear codes, see jH Algorithm 1]. 



5.3.2.1 Increased Rank Suppose that the normal vector Vj,j G Fixed 
Fixed m^n) contains some nonzero entry in the set . . . , k}. In this case we 

can perform some elementary row operations in order to map Vj to the unit vec- 
tor Ct+i, i.e. there is some matrix A = (0^2) ^ GL^*'''^ such that Avj = Ct+i- 

In particular, applying the group element {A~^^ , /^(i), . . . , Is{n), l'^, t°) G Inn 
leads to this result. 

The new stabilizer can be described by t + 1 and the partition pU{{t + l}}. 
Condition Q ensures that this vector can not be changed anymore. 



5.3.2.2 Same Rank In the second case, the support of the newly fixed 
normal vector Vj,j G Fixed (fH^) \ Fixed is contained in the set [t]. In 

this case we can use each block P G p : P fl supp(uj) 7^ in order to map the 
nonzero entry {vj)i with i := max (P fl supp(fj)) onto If,. This is done using 
the simultaneous multiplication of all rows indexed by i' G P with {vj)~'^ which 
corresponds to the multiplication by the matrix A := (o'/fe'it) ^ GL^*'''^(g) 



with -Dj'^j' = (vj)^^ for i' G P. Corollary 21 ensures that we find a group 
element having the necessary GLfe(g)-component A"^ ^ . Furthermore, we can 
choose such a group element such that b{A^ = 1. 

Finally, we may use the remaining field automorphisms {{Ik, Is{i), ■ ■ ■ , h{n), 1^, t'^)) < 
Inn to minimize the remaining nonzero, non-identity entries of the vector Vj 
starting from the highest index. 

In the following, we are not allowed to multiply two different nonzero ele- 
ments of this vector by different units since we are only able to revert these 
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multiplications by the multiplication of the whole column with the same el- 
ement bj. Hence, the new stabilizer can be described by t and p' := {P G 
p I P n supp(fj) = 0} U {Upep:Pnsupp(i)j)7^0 -P}- The stabilizer under the field 
automorphisms can be expressed by the smallest multiple e' of e such that 

Example 22 (Example [l9| continued) . Suppose that the target cell selection 
told us to split the last cell {12, 13} G £Ho in the individualization step. Suppose 
we are in the branch of the backtracking where we applied the identity element. 
The refined partition contains two singletons {12} and {13}. Minimizing 
Vg yields the representative 
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and the inner stabilizer with parameters t = l,p = {{1}}. The horizontal line 
shows the parameter t, which decomposes the matrices Ui into two subma- 
trices. Similarly, we perform some elementary row operations on the second 
fixed coordinate fio, leading to 
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and the inner stabilizer parameter t = 2,p = {{1}, {2}}. 

Suppose that we would also have to minimize the entry f 7 = (2, 1, 0, 0)"^ in 
the next step under this stabilizer. Then we would be able to minimize f 7 by 

the multiplication with the matrix A := ^ [] ^ ^ ^ ? we would apply a group 

element 



{A~'^,B{A~'^),b{A-'^),T'). 
The stabilizer would be modified such that the GL4(3)-component would be 
equal to Glf ^^'^^^(3). 

For the minimization of the sequence elements Ui of U we distinguish two 
cases. The first is, that we fixed at least one normal vector of a hyperplane 
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whose minimization led to an increased parameter t, see Subsection 5.3.2.1 In 
this case we have to update all matrices Ui for i E [n], i.e. we have to perform 
the following procedure: 



5.3.2.3 Increasing rank for the sequence of newly fixed normal 
vectors In this case, we know that theGL^- component Il Qi^^ {Inn) of Inn 

u)T I I r^" 

is a subgroup of GL^ after finishing |5.3.2.1 Proposition 20 ensures that 
these matrices stabilize {Ui)^^-^ ^ up to scalar multiplications of the rows. Hence, 
we can use the action from the right in order to map {Ui)^^-^ ^ onto its well- 
defined reduced column echelon form RCEF . In fact, it is sufficient to 
produce the reduced column echelon form up to multiplications of the columns 
by elements in F*. This is due to the fact that at this point there is no decision 
on the ordering of the elements of those cells of 9^c which are not singletons. 

This canonization corresponds to the missing third homomorphism of group 
actions: 



{U,V) ■. = [[GLf>{q) X GLi;i;^^(g)jRCEF(Rows,(f/.))j^^^^j 
: Inn ^ (oUigf x H GL,(,)(g)^ x Aut(F,) 

Remark 23. The mapping G(^')(t/, V) ^ Can^^L.terxnti GL.(,)(q))>.Aut(F,) i^i^, 
is an S'tp^ -homomorphism, but it is not S'sp^ -invariant. Therefore, we also could 
derive a refinement of in the application of this minimization. Neverthe- 
less, we included this refinement in the inner minimization process, since it 
allows us to reduce the inner stabilizer, too. 



Example 24 (Example 22 continued). Since the minimization of the normal 
vectors changed the parameter t of the inner stabilizer, we have to perform 
this step for each i E [3], resulting in U equal to 



^1 0|0 0^ 
oil 
1|0 1 

^0 olo 1 I J 

Since we want to preserve the (2 x 1) upper left submatrices of Ui and U2 
up to row and column multiphcations, we choose the parameters (^1,^2,^3) = 
(1, 1, 0). This also leads to a refinement of the cell {1, 2}. 



26 



Finally, in a last step we guarantee the minimality of {Ui)^^-^ ^ for those indices 
i G Fixed (iHc)- We use the methods described in the following in order to 
guarantee that (t/j)^ i is minimal and unchanged. 



5.3.2.4 Singletons This procedure is applied to all i G Fixed (9ic)- We 
can prove that the set of matrices we are allowed to apply on this component 
via a multiplication with the transpose from the right is equal to the su bgroup 
^'^^s{i)'^ for some partition pi of [tj]. The minimization in Subsection 5.3.2.3 
guarantees that the submatrix {Ui)^^^ decomposes as = [ti]^ O) 

where {Ui)^^-^ [^.j is up to scalars a (t x tj)-matrix in reduced column echelon 

form. The action of GL^*''''' from the left and GL^*^'^''*-* from the right only 
multiplies rows or columns of (f/j)[f] [^.j by nonzero entries of the field ¥q. 
We can easily compute the smallest possible image of {Ui)^^-^ j^.] under this 
simultaneous action by examining the operation column by column (the same 



algorithm as for the normal vectors, cf. 5.3.2.2). Similarly, in a subsequent 
step, we use the remaining automorphisms of the field, i.e. the group (r^) for 
the minimization of the nonzero entries. 

Example 25. We give a bigger example over Fig. Let ^ G Fig be a primitive 
element and suppose we have to minimize 



Ui-.-- 



( e o\ 



^8 



under the action of the inner stabilizer, whose rL5(16)-component is defined 
by t = 4,p = {{1,2}, {3}, {4}} and e = 1. The index i G [n] should be a 
newly produced singleton of the actual partition *p and we already produced 
a reduced column echelon form (up to scalars) by the methods in the previous 
paragraph. 



First of all we must minimize the first column leading to U j using the diagonal 



matrix with diagonal entries (C^,^'') 
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The Frobenius automorphism stabihzes all entries of the newly produced col- 
umn. In the following, we are only allowed to apply diagonal matrices which 
are constant on the cells {1,2,4} and {3}. In the next step, we minimize the 
second column using those row multiplications. In order to map the entry e 
to 1 we use the simultaneous multiplication by (^^,^^,1,^®). Note that we 
can revert this multiplication on the first column via the multiplication of the 
whole column with e ■ Furthermore, we multiply the third row by e ^ lead- 
ing to Ui. Finally, we use the Frobenius automorphism to minimize the only 
nonzero and non-identity entry e^ to see U-. The rL5(16)-component of 
the inner stabilizer is defined by t' = 4, p' = {1, 2, 3, 4} and e' = 2. 



5.4 Refinements 



One of the most crucial tasks in the algorithm is the pruning of subtrees. 
We have already mentioned how the group of known automorphisms could be 



used for this task. Furthermore, Subsection |5.3| allows us to prune subtrees 
based on the semicanonical representative of the node. Since those mappings 
heavily depend on Inn^^''^\ they usually will have poor performance on the 
first levels of the search tree. The same holds for the homomorphism we are 
going to introduce in the next subsection. 



5.4-1 Inner Minimization Refine 



Similar to Remark 23 the mapping 



^Subset . G(«i)7r(t/,V) ^ (Subset [t](supp(t;j'P'"^)) 



J6[ 



where Subsets (X) := ' ' ^ ^ for X C [/c], defines an Sm^-homomorphism. 

' 0, else 
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Furthermore, we can predict the result of the inner minimization for the un- 
fixed normal vectors and use this is as an Ssp^-homomorphism as well: 



e 



rain/H 



I— 7- I mm 

(Ar«)eGL(*'>"(g)>.(r'=> 



This function is easily computable. In the case that 6^^^^'^^{{U^V))j = we 
know that O'^^'^^U, V)j = et+i- Otherwise, we can use the methods described 



in Subsection 5.3.2.2|in order to compute the smallest possible representative. 



We can make similar computations on the positions i G [n], i.e. we can define 
the S'qjg-homomorphism 6'™™''' with 



mm 

iD,E,T'')G 
(GLf''"{g)xGL<*')(g))M(r'=> 



Again, this function is easily computable using the methods described in Sub- 
section I5.3.2.4[ 



5.4-2 Colored Incidence Graph 

Associated with the semicanonical representative (U^'^'^\ yPP.'^)) of a node is 
the bipartite Graph G with vertex set [n + h] and edges 



{^,J} I ^e[n],Je {[n + h] \ [n]) : vfj:'>^ui'^'''^ = o] . 



Using the partition *p we may cell-wise count the neighbors of a vertex u G 
[n + h] which defines an S'rp-homomorphism. 

Finally, we have the possibility to color the edges of this graph as well. There- 
fore, we investigate the result of Vj^''^^ Uj;'^''^^ ^ under the action of the inner 



stabilizer. For some arbitrary A G GL'^^ ''^\qy , B 
¥* and G (r^) we have 




GGLl*;5(g),6,G 



/ 



E Bi 
Bo 



^ 



(q3,7r)^^(q3,7r) 



E Bi 
Bo 
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Now, substitute vf'""^ Uf^'^' = (^1,^2) with wi G F*» and W2 G ¥f^-*\ The 
action of Inn^'^''^^ changes the result of this product as given in the equation 
above. Hence, we can distinguish the edges (introduce colors) based on the 
orbits of {wi,W2) under the group action of (^GL^*^j x F*^ x (r"^). 

Canonical representatives of these orbits could be easily computed using the 
following observation: In the case that W2 7^ 0, we are able to find some matrix 
B to map the vector {wi, W2) onto the (ti + l)-th unit vector. In the case that 
W2 is equal to 0, we observe that ii^ is a diagonal matrix, hence the support of 
the vector wi is fixed by the application of this matrix. 

Again, cell-wise counting of neighbors distinguished by the coloring of the 
edges defines an S'tp-homomorphism. The condition ([s]) for the positions i G 
Fixed m^c) furthermore restricts the diagonal matrices E even more. In this 
case, it is even possible to give a refined coloring on the edges which allows 
the definition of a stronger Sip-homomorphism. 

Example 26. Suppose that the partition *p of the actual node contains the 
cells {1, 2} and {1 + n, 2 + n}. Furthermore, the inner stabilizer is defined by 
t = 3, p = {{!}, {2, 3}} and e = 2. The example should be over F4 with k = 6 
and 



{Ui,U2,Vi,V2) = 
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We first give the results of the multiplication vJUf. 





i=l 


i=2 


j=i 


(i,e',o,o) 


(1,1,1, a 


j=2 


(i,e,o,i) 


(i,e,o,o) 



There is exactly one entry in each column and each row whose W2 component 
is equal to zero. The support of the corresponding Wi'''^ part is in both cases 
equal to {1,2}. Hence, the observation of the vertex- and edge-colored graph 
would not lead to refinement in this case. 
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In contrary, if the partition p would be {{1,2,3}} instead, we would observe 

that the matrices which are multiplied from the right to the elements Ui and 

U2 under the action of the inner stabilizer must be elements in GL4 ' (4). 

Hence, the orbits under this restricted action would be {(1, 0, 0), (^, 1, 0, 0), (^^, ^, 0, 0)} 

and {(1,^,0,0), (^,^2^ 0,0), (^^ 1,0,0)}. Therefore, we would be able to color 

the edges (1, 1 + n) and (2,2 + n) differently. This leads to a refinement of 

both cells. 



5.4-3 Iterative Refinements 

In the case that one of these refinements leads to a new singleton in the 



partition we use the inner minimization procedure of Section |5.3| in order 
to get some smaller group Inn^^'^\ The result of this minimization is compared 
with the candidate for the canonical form to prune the tree at an early stage. 

In the case that the inner minimization leads to a smaller group Inn^^''^\ we 



use the refinements described in Subsection 5.4.1 immediately after the inner 
minimization procedure. 



Since the refinement based on Subsection 5.4.2 is the most expensive, we try to 
avoid its application as long as possible. In the case that all other refinements 
fail and that we have updated Inn^'^'^^ by some smaller group or that we have 
replaced the partition *p by a finer partition since the last call of this function, 
the incidence graph may provide a refinement. 

Rather than computing the S'tp-homomorphism for all indices i & [n + h] at 
once, we compute the result iteratively for each pair P G Q G of cells 
and call the refinement after each step. The ordering of the cells in this regard 
should be determined based on the information of all previous steps and we 
are not yet sure about the optimal strategy. 



5.5 The algorithm 



Algorithm [T] gives a recursive description of the backtrack tree generation and 
Figure [2] visualizes this process. As we already mentioned, this tree is tra- 
versed in a depth-first search approach. Therefore we store some candidate 
for the canonical form in a global variable (^Can^ ^Can^_ This is the element 
which compares less than all other leaf nodes already visited including all 
comparisons performed on the paths to these nodes. At the end, this candi- 
date is defined to be the unique representative of the orbit. If this variable is 
uninitialized (= NIL), the leaf node which will be visited next becomes the 
candidate for the canonical form. Two further global variables A, T maintain 
the group of known automorphisms, which could be used for further pruning 
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the tree. The subroutine TargetCell chooses the target cell in this step. 
Similarly, RefinementFunc defines the next Sfpz-homomorphism which has 
to be applied. This function also may return NIL which indicates that the 
refinement process should finish. 

The function InnerMinimization implements the inner minimization as de- 



scribed in Section 5.3 In the case that the result of the minimization (?/', V) 
is smaller than (f/^, ^/Can-j _^ NIL, this function sets the global variable 
(jjCa.n^ yCa.n^ A^JL. On the othcr hand, if there is a smaller candidate for 
the canonical form, the function returns isJeq = false. Otherwise, this flag is 
set to true. Similarly, Refinement implements the Homomorphism Princi- 
ple for some given S'tp'-homomorphism /. It updates the variables in the same 
way. Furthermore, it also calls the InnerMinimization function in the case 
that a further singleton appeared in ^i'. 

During the backtracking it is not necessary to maintain a group element 
{A,B,b,T'') e G(^^), which maps the root node to the semicanonical represen- 
tative of the actual node. The permutation tt G Sn+h and the corresponding 
path to the leaf (D,7r) define an element (A^, 5^, 6^, r'**"') G G^'^) which 
maps the initial sequence {U, V) of the root node to the semicanonical repre- 
sentative of this leaf. The element {A^'^\ B^^\b^'^\T°''''''') is well defined up to 
the multiplication by Inn^^''^^ from the left and it is only computed for some 
few leaves. Furthermore, since we are only interested in a canonization map 
for C, we may restrict this computation to its rLfc(g)-component (^A*^'^^ r"^"'^ . 

Let TT^^'^ G Sn+h be the permutation leading to the canonical form and let 
(Ji, . . . , (T^ G Sn+h define generators of the automorphism group Aut used for 
pruning the search tree. The canonical form of CanrLfe(g)(C) is defined to be 
the sequence of subsets of subspaces given by the column spaces of Up^'^. A 

transporter element TRrLfe(g)(C) is given by ^A*^'''^'"'^ r"'" 



Proposition 27. With the help of the elements r"'" "''j, i G [z], 

the group Inn^^''^'^ ^ and ^A^'"'^'"'^\ r"'^'' we are able to compute generators 
of the automorphism group Aut(C) < rLfe(g).- Aut(C) is generated by 



and <! fA(-"^"),r'^<^^^"M ^ f A^^"^"-), r«'^""-M I z G \z 



Note that (2), 7r*-^^°(jj) may not appear as a node of the pruned search tree 
because of the pruning based on the group of known automorphisms. In this 
case, we still know in which order we have to apply the methods from Sec- 
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Algorithm 1 BACKTRACK 



Require: global variable {U'^^'^, V"^^^) - candidate for the canonical form 
Require: global variable vr^'^'^ - the permutation leading to the candidate 
Require: global variable Aut < Sn+h ^ the group of known automorphisms 
Require: global variable T C Sn+h ^ a left transversal of Aut in Sn+h 
Require: vr, U, V) a node of the backtrack tree 
Ensure: individualization-refinement step on vr, U, V) 

1: procedure Backtrack(*P, tc, U, V, Inn) 

2: if *p is discrete then 

3: if (f/^^", 1/^^") = NIL then 

4: (f/^^^, 1/^^^, TT^^") ^ {U, V,7i) > a new candidate 

5: else 

6: Aut ^ (^Aut, Tr^-^Tv'^^^'^ t> a new automorphism 

7: T ^ left transversal of Aut in Sn+h 

8: end if 

9: return 



10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 



23 
24 
25 
26 
27 
28 
29 



end if 

P ^ TargetCell(*P, vr, Inn), m min(P) > target cell selection 
^' ^ \ P) U {{m}, P \ {m}} 

for j G P do > individualization 

t ^ (m,j), 7^' ^ ivr, iU',V') 4- 

(is Jeg, vr', t/', 1/', /nn')) ^ InnerMinimization(<P', tt', t/', 1/', /nn) 
/ ^ RefinementFunc(*P', /nn') > choose an S'lp'-homomorphism 
while isJeq and / ^ NIL do 

if T n Srp/TT = then > see 14* Lemma 5.9] 

return 
end if 

{isJeq,i^',7r',U',V',Inn')) ^ 
Refinement(/, vr', U', V, Inn') 
22: / ^ RefinementFunc(*P', Inn') 

end while 
if not isJeq then 

return > the actual candidate {U'^^^, V'^^^) is smaller 

end if 

Backtrack (<p', vr', U', V, Inn') 
end for 
end procedure 



tion 5.3 to compute its semicanonical representative, since it defines the same 



canonical form and hence the inner minimization has to follow the same rules 



than the computation of ^A^'^'^'"^\ t""^"" 



Theorem 28. The mapping C h-?- (^CanrLfe(g)(C), TRrLj.(g)(C), Aut(C)j, where 
CanrLfc(g) and TRrhkig) ^'^^ defined as above, solves the canonization problem 
and Algorithm\l\ is a practical algorithm to compute the data. 



33 




TT G 5,, X Sh 



P = TargetCellPP, it. Inn) 



Fixed (qj) 



k= (m,j) 



InnerMinimization 



I 



Refinement <■ 



I 



InnerMinimization 
I 
I 



Fixed (<K) 




compare 
results: 
prune 
the tree 
or reset 

^^Can yCan^ 

if nec- 

cessary 



individualization 
■■4 







1 
1 

. . • InnerMinimization 
1 















Refinement 



InnerMinimization 



Fixed m) 



Candidate 
the canonical form ^ 



6 Applications 



Fig. 2. Backtrack tree generation 



In we gave running times for tlie computation of the automorpliism groups 
of a family of almost perfect nonlinear (APN-) function f^'^^ : F2 — )■ F2, x t— )■ x^. 
These computations were done using a reformulation as a linear code Cj"^^ C 
Fn'' of dimension 2d+l. 



Definition 29. A set C of ci-dimensional subspaces in Pg(A; + 1) with \C\ = 
1 + (g'^"^^ — l)/(g — 1) is called a — l)-dimensional dual hyperoval if the 
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Table 1 

Running times for cj-'^^ compared to c'^p for /(a;) = 

intersection of any two distinct elements of C is a point and any three have an 
empty intersection. 

[H] gives a construction of id — l)-dimensional dual hyperovals d^p in P2(2(i) 
usmer quadratic APN-functions f'^^ : F^. A gain we use the quadratic 

function a; i— )■ to produce a family of subsets to test our algorithm. By 

[3] we know that the automorphism group of dp and are identical for 
(i > 4. This allows us on the one hand to test the algorithm for correctness 
and on the other to compare its performance with the algorithm for linear 
codes. Table [6] shows the running times for different on a single core of a 2.4 
GHz Intel Quad 2 processor. 



7 Conclusion 

This works presents a practical algorithm which solves the canonization prob- 
lem for sequences of subsets of Pg(/c). From the reduction to the graph iso- 
morphism problem, we know that we could not expect to give an algorithm 
that runs in polynomial time. 

The algorithm itself relies on many heuristics, for instance the choice of the 
target cell, the choice of the homomorphism of group actions which has to be 
applied next or when to stop the refinements since they are more expensive 
than performing an individualization step. These problems are well-known 
from the canonization of graphs where possible modifications of the basic 
algorithm are discussed in several papers. 
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Similarly, the improvement of these heuristics is still part of our current re- 
search and we are not yet sure about an optimal strategy. This is also the 
reason of not giving the full implementation details in this regard. 
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